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REPORT  SUMMARY 


In  Section  1 Baxter  and  Rushforth  review  the 
atmospheric  turbulence  deblurring  problem.  Based  on  this 
review,  a research  plan  is  being  developed  for  research  in 
the  area  . 

In  Section  2 Atashroo  reports  on  his  mathematical 
techniaues  for  modeling  the  "zeros"  in  synthetic  speech. 
This  work  is  continuing  so  these  preliminary  technical 
results  are  not  yet  conclusive. 

In  Section  3 Callahan  abstracts  the  finished  work  on 
Short  Time  Spectrum  Acoustic  Processing. 

In  Section  ^4  Ingebretsen  abstracts  his  work  on  two 
estimation  parameters  for  audio  signals. 

In  Section  5 Christiansen  abstracts  his  method  for 
recognizing  words  in  continuous  speech. 

In  Section  6 Peterson  and  Atashroo  discuss  a noise 
filtering  method  for  speech  signals. 

In  Section  7 Done  briefly  describes  an  attempt  at 
linear  predictive  coding  using  a glottal  waveform  model. 
This  approach  did  not  improve  speech  quality  sufficiently 
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Section  8 reports  an  attempt  to  use  more  waveform  phase 
information  in  a speech  synthesis  method. 
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SECTION  1 

REMOVING  ATMOSPHERIC  TURHULRNCE  Hl.UR 

Brent  Baxter  Craig  Rushforth 

1 . Introduction 

The  removal  of  the  effects  of  atmospheric  turbulence 
from  optical  images  is  a significant  problem  of  long 
standing.  Recent  investigations  by  Knox  and  Thompson  have 
led  to  the  development  of  a restoration  procedure  which 
shows  considerable  promise.  This  procedure  has  not  been 
successfully  applied  to  real  data  as  yet,  however,  nor  has 
it  been  sufficiently  well  analyzed  and  simulated  to  provide 
a thorough  quantitative  understanding  of  their  properties. 
Furthermore,  these  procedures  will  very  likely  require 
modification  before  they  can  be  practically  applied  to  large 
quantities  of  real  data.  We  have  begun  an  investigation  of 
Knox's  method  aimed  at  finding  suitable  ways  to  apply  it  to 
real  data  . 

1.1.  Pjgbleni  abatement 

One  of  the  most  difficult  and  serious  problems 
associated  with  high-resolution  optical  imaging  systems  is 
the  distortion  introduced  by  atmospheric  turbulence.  If 
images  produced  in  the  presence  of  atmospheric  turbulence 
are  simply  recorded  and  processed  by  conventional  methods, 
the  result  is  often  a loss  of  essential 
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high-spatial-frequency  detail.  This  problem  is  serious  even 
under  favorable  conditions  at  a good  site,  and  it  is  made 
worse  by  such  effects  as  wind,  solar  heating,  and 
boundary-layer  turbulence  near  a moving  object.  Much  of  the 
potential  resolving  power  of  a large  telescope  will  be  lost 
unless  the  received  optical  field  is  processed  and  recorded 
properly. 

There  now  exist  techniques  whose  potentials  for 
reducing  the  effects  of  atmospheric  turbulence  have  been 
demonstrated  theoretically  and  by  simulation.  However,  much 
work  remains  to  be  done  before  the  promise  of  these  or 
related  techniques  can  be  realized  with  real  data.  Foremost 
among  the  problems  which  remain  are  noise  sensitivity  and 
prohibitive  computational  complexity.  A third  important 
issue  is  related  to  an  assumption  which  is  explicit  or 
implicit  in  almost  all  work  published  to  date,  namely  that 
the  imaging  is  isoplanatic;  i.e.,  that  the  instantaneous 
point-spread  function  of  the  atmosphere-telescope  system  is 
the  same  for  all  points  in  the  object.  This  will  usually  be 
true  only  for  objects  of  very  limited  extent.  Thus, 
appropriate  extensions  will  have  to  be  considered  if  these 
techniques  are  to  be  applied  to  objects  larger  than  a single 
isoplanatic  patch. 

There  is  a final  issue  which  often  falls  to  receive  the 
attention  that  it  deserves,  namely  the  skilled  and  careful 
handling  and  recording  of  Images  using  proper  techniques  and 
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eq  11  i pmen  t . Using  the  wrong  type  of  film  or  developing 
process,  impropei’  gray  scale  compensation,  or  a careless 
scanning  and  digitizing  procedure  may  significantly  detract 
I'rom  the  effectiveness  of  a procedure  for  removing 
turbulence  blur.  Furthermore,  it  is  difficult  to  assess  the 
real  merit  of  a technique  whose  results  are  presented  using 
carelessly-prepared  photographs.  Unfortunately,  these 
problems  occur  more  often  than  one  might  expect. 

2 . Summary  of  Previous  Work 

In  this  section,  we  provide  a brief  summary  of  previous 
work  on  the  problem  of  restoring  Images  which  have  been 
blurred  by  atmospheric  turbulence.  We  make  no  attempt  to  be 
comprehensive,  but  rather  restrict  our  attention  to  work 
which  is  immediately  relevant  to  our  problem.  For  a more 
complete  discussion  of  other  work,  and  for  a comprehensive 
bibliography,  see  [1]. 

2.1.  Long-Time  Averaging 

The  problem  in  which  we  are  interested  is  characterized 
by  an  object  intensity  distribution  which  is  constant  for  a 
long  period  of  time,  during  which  the  conditions  of  the 
atmosphere  change  appreciably.  One  obvious  approach  to 
removing  the  effects  of  random  atmospheric  turbulence  is  to 
perform  a long-time  average  of  the  image  Intensity 
distribution.  This  type  of  averaging  can  be  effected  either 
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by  exposing  film  or  another  recording  medium  for  a long 
time,  or  by  averaging  many  short-time  exposures.  In  either 
case,  the  averaging  has  the  effect  of  removing  the  random 
fluctuations  from  the  resulting  image.  Figure  1 shows  such 
an  image. 

Unfortunately,  this  long-time  averaging  also  has  the 
effect  of  removing  or  greatly  attenuating  the  high  spatial 
frequencies  from  the  image  [11].  Thus,  although  the  average 
image  is  stable,  its  spatial-frequency  content  is  limited  by 
atmospheric  "seeing"  rather  than  by  the  imaging  system. 
Furthermore,  only  limited  improvement  can  be  achieved  by 
applying  standard  restoration  techniques  to  the  long-time 
average  image.  As  an  example  of  the  seriousness  of  this 
effect,  consider  the  200-inch  Hale  telescope.  The  diameter 
of  the  Airy  disk  for  this  telescope  is  about  0.05  arc 
seconds,  while  a typical  seeing  limit  on  a quiet  night  is 
about  2 arc  seconds.  Thus,  if  long-time  averaging  is 
employed,  atmospheric  turbulence  degrades  the  resolving 
power  of  this  system  by  a factor  of  about  40.  If 
high-resolution  Imaging  systems  are  to  reach  their  potential 
in  the  presence  of  atmospheric  turbulence,  then,  it  is  clear 
that  alternative  methods  which  yield  statistical  stability 
while  preserving  high  spatial  frequencies  are  required. 
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(a) 


(b) 


FIGURE  1 

Long  exposure  turbulence. 

(a)  Two  second  exposure  made  through  a questar  telescope. 

(b)  Attempts  to  recover  the  high  spatial  frequency 
information  result  some  improvement  accompanied  by 
amplification  of  film  grain  noise. 
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A partial  solution  to  this  problem  was  obtained  by 
l.abeyrie  [12,  1^],  whose  technique  has  come  to  be  known  as 
speckle  interferometry  because  of  the  speckled  character  of 
the  images  used.  An  example  of  such  an  image  is  shown  in 
Figure  5.  In  this  technique,  a series  of  short-time 
exposures  of  the  atmospherically-degraded  object  are  taken 
through  a narrow-band  spectral  filter.  The  exposure  time  is 
short  enough  that  the  atmosphere  is  effectively  "frozen" 
during  the  exposure,  and  successive  photographs  are 
sufficiently  separated  in  time  that  they  may  be  taken  to  be 
statistically  independent  for  the  purpose  of  averaging. 

Instead  of  simply  averaging  these  short-time  exposures 
directly,  a process  which  leads  to  a loss  of  high  spatial 
frequencies  as  we  have  seen,  speckle  interferometry  requires 
some  additional  operations  before  averaging.  First,  the 
Fourier  transform  of  each  image  is  taken,  either  optically 
or  digitally.  The  averaging  is  then  performed  on  the 
squared  magnitude  of  these  individual  Fourier  transforms. 

The  process  of  taking  the  square  of  the  magnitude 
destroys  the  information  regarding  the  phase  of  the  image. 
In  this  fact  lies  both  the  strength  and  the  weakness  of 
speckle  interferometry.  Since  it  is  the  random  phases  of 
the  h igh-spa t ia 1- frequency  components  which  cause  their 
mutual  cancellation  in  the  process  of  long-time  averaging, 
the  suppression  of  phase  prevents  this  cancellation  and  thus 
preserves  some  high-frequency  information  which  would  be 
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lost  under  direct  long-time  averaging.  Unfortunately, 
information  about  the  phase  of  the  object  is  also 
suppressed.  This  loss  of  phase  information  limits  the  class 
of  objects  to  which  speckle  interferometry  can  be  usefully 
applied.  For  example,  this  technique  can  yield  useful 
information  about  symmetric  objects  or  binary  star  systems, 
but  not  about  an  object  with  an  arbitrary  intensity 
distribution  or  arbitrary  shape. 

Labeyrie  and  his  colleagues  have  verified 
experimentally  that  speckle  interferometry  yields 
essentially  di f f r ac t ion - 1 im i ted  information  about  such 
objects  as  binary  star  systems.  In  addition,  theoretical 
discussions  of  this  technique  have  been  provided  by  Miller 
and  Korff  [1^]  and  by  Korff  [15].  These  authors  have  used 
models  of  propagation  in  the  turbulent  atmosphere  to  explain 
why  speckle  interferometry  works  and  to  generalize  the 
results  to  par tially-coherent  objects. 
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2.3.  The  Knox-Thompson  Technique 

The  success  of  speckle  interferometry  in  reducing  the 
effects  of  atmospheric  turbulence,  coupled  with  its  limited 
applicability,  has  spurred  a search  for  generalizations  and 
improvements  of  this  technique.  A significant  extension  of 
speckle  interferometry  was  developed  by  Knox  and  Thompson  in 
1974  [1].  This  new  procedure  uses  speckle  interferometry  as 
before  to  obtain  amplitude  information.  To  this  amplitude 
information  is  added  phase  information  obtained  by  averaging 
the  Fourier  transforms  of  the  short-time  exposures  in  a 
different  way. 

We  denote  the  Fourier  transform  of  the  short-exposure 

image  intensity  distribution  by  I(u).  The  quantity 

calculated  in  speckle  interferometry,  from  which  the 

2 

amplitude  information  can  be  obtained,  is  <|l(u)|  > , where 
the  pointed  brackets  denote  an  average  over  many  images. 
Knox  and  Thompson  use  this  same  procedure  to  obtain 
amplitude  information. 

Knox  and  Thompson  have  shown  that  differential  phase 
may  be  obtained  from  the  quantity  <I(Ui)l  (u^)^,  where  u-j 
and  U2  are  spatial  frequencies  differing  from  each  other  by 
only  a small  amount,  and  the  asterisk  denotes  the  complex 
conjugate.  Differential  phases  can  then  be  added  to  produce 
an  object  phase  distribution  which,  together  with  the  object 
amplitude  (from  speckle  interferometry),  characterize  the 
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object  intensity  distribution  out  to  the  diffraction  limit 
of  the  telescope.  This  technique  thus  offers  the  potential 
of  greatly  improved  resolution  for  objects  with  arbitrary 
intensity  distributions. 

Unfortunately,  the  effectiveness  of  the  Knox-Thompson 
technique  has  not  as  yet  been  demonstrated  on  real  data. 
Simulations  with  data  free  of  sensor  noise  performed  by  Knox 
[1],  by  ourselves,  and  by  others  [16]  have  led  to  results  in 
general  agreement  with  predictions,  however,  and  there  is 
reason  to  believe  that  the  technique  will  work  reasonably 
well  on  real  images,  provided  that  they  are  of  high  quality 
(noise  free)  and  that  there  are  enough  of  them  to  allow 
adequate  averaging.  There  are  indications  that  the 
technique  may  be  quite  sensitive  to  noise,  however  [2]. 
This  concern  has  led  to  a search  for  alternatives  which  are 
more  robust.  One  such  approach  is  discussed  in  the  next 
section  . 

Before  we  turn  to  that  discussion,  we  briefly  summarize 
the  results  of  a one-dimensional  simulation  of  the 
Knox-Thompson  technique  which  we  performed  here  at  the 
University  of  Utah.  Figure  2a  shows  a collection  of  sample 
functions  of  random  wavefront  phase  induced  by  atmospheric 
turbulence  according  to  the  physical  model  described  in  [1]. 
A typical  point  spread  function  of  the  atmosphere-telescope 
combination  is  shown  in  Figure  2b.  Figure  2c  shows  the 
intensity  distribution  of  the  object  to  be  imaged,  and 
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Kiguro  2(1  shows  the  blurred  Image  obtained  by  convolving 
Kigure  2c  with  Figure  2b. 

Figure  3a  shows  the  result  of  averaging  64  point  spread 
functions  similar  to  that  shown  in  Figure  2b,  Figures  3b 
and  3c  show  the  RMS  image  Fourier  transform  and  the  squared 
magnitude  of  the  object  Fourier  transform.  Figures  4a  and 
4b  illustrate  the  phase  reconstruction  process,  and  finally 
Figure  4c  shows  the  estimate  of  the  object  intensity 
distribution  obtained  using  the  Knox-Thompson  technique. 
Note  the  good  agreement  with  Figure  2c. 

The  simulation  described  above  did  not  include  the 
effects  of  sensor  noise.  Although  we  have  not  as  yet 
attempted  to  quantify  the  effects  of  sensor  noise  on  the 
performance  of  the  Knox-Thompson  method,  we  have  some 
preliminary  indications  that  tend  to  support  Sherman’s  [2] 
contention  that  these  effects  can  be  quite  severe.  These 
indications  were  obtained  during  a preliminary  attempt  to 
apply  the  Knox-Thompson  procedure  to  some  photographs  at  the 
Kitt  Peak  Observatory.  We  believe  that  our  failure  to 
achieve  satisfactory  results  in  this  preliminary  experiment 
was  due  in  large  measure  to  the  poor  quality  of  the  data, 
particularly  the  large  amount  of  sensor  noise  present. 


/f 


-a 


a • 

a 


(a) 


(b) 


• • 

7 • 

• • 

t • 

4 • 

J • 

a • 

I • 

• • 

eeaec 


' ■ 

j 

. i 

— 

1aJ_ 

A ^ 

?; 

1 

1 

1 

J 

y V V 

(c) 


FIGURE  4 

Phase  estimation  using  Knox's  method. 

(a)  Phase  differences  are  small  compared  with  2n  preventing 
any  ambiguity  In  the  averaging  process. 

(bl  Reconstructed  phase. 

(c)  Object  Intensity  distribution  estimated  by  computing 
the  Inverse  Fourier  transform  of  3(c)  and  4(b). 
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2 . U . Real-Time  Phase  Compensation 

An  entirely  different  approach  to  the  reduction  of 
atmospheric  turbulence  effects  is  to  try  to  deal  with  these 
effects  before  an  image  is  recorded  rather  than  after.  In 
many  circumstances,  the  effects  of  atmospheric  turbulence 
can  be  modeled  as  a phase  perturbation  of  the  received 
wavefront  at  the  imaging  aperture.  If  this  phase 
perturbation  were  known,  or  if  an  accurate  estimate  could  be 
obtained,  the  effects  of  turbulence  could  be  removed  by 
f^nase  compensation.  Several  investigators  have  considered 
the  possibility  of  Implementing  real-time  phase-compensation 
systems  [4,  6].  In  these  systems,  measurements  of  optical 
phase  perturbation  are  made  over  the  objective  pupil,  either 
explicitly  or  Implicitly,  with  the  result  being  used  to 
control  a deformable  mirror  or  some  other  phase-correcting 
device  to  compensate  for  atmospheric  turbulence  in  real  time 
before  the  image  is  recorded.  Such  systems  avoid  the  heavy 
burden  of  digital  computation  which  we  have  described  above 
in  connection  with  the  Knox-Thompson  and  Sherman  techniques, 
but  on  the  other  hand  they  require  more  complicated  optical 
equipment  and  control  systems.  Furthermore,  the  accuracy  of 
phase  compensation  systems  is  limited  by  phase-estimation 
errors  due  to  the  finite  wavefront-sensor  signal-to-noise 
ratio  and  by  fitting  errors  due  to  the  finite  number  of 
spatial  modes  restored  by  the  wavefront  corrector  [6]. 
Thus,  we  believe  that  further  extensions  and  Improvements  of 


both  phase  compensation  systems  and  post-processing 
techniques  of  the  type  we  have  discussed  above  should  be 
undertaken. 

2.5.  The  Work  of  Shapiro 

In  an  interesting  series  of  paper  [6,  9],  Shapiro  has 
established  what  appears  to  be  a very  useful  framework  for 
the  analysis  and  design  of  optical  imaging  systems  which 
attempt  to  remove  the  effects  of  atmospheric  turbulence.  In 
[9],  he  develops  a normal-mode  decomposition  for  the 
turbulent  atmosphere  which  is  similar  to  that  developed 
previously  for  free-space  imaging  by  Rushforth  and  Harris 
[17]  and  others.  He  then  applies  these  results  to  a study 
of  the  ultimate  performance  limits  on  imaging  through  a 
turbulent  atmosphere  [6,  8].  In  agreement  with  others,  he 
shows  that  it  is  possible  in  principle  to  achieve 
diffraction-limited  imaging. 

Finally,  Shapiro  in  [7]  considers  the  conditions  under 
which  diffraction-limited  imaging  may  be  achieved  even  when 
the  extent  of  the  object  is  such  that  the  imaging  is  no 
longer  Isoplanatlc;  i.e.,  when  the  atmospheric  point  spread 
function  depends  upon  the  position  of  the  point  source  in 
the  object  plane.  Since  all  the  work  discussed  in  Sections 
2.1  through  2.5  is  based  on  the  assumption  of  isoplanatlc 
imaging,  and  since  in  many  practical  situations  the  imaging 
will  not  be  isoplanatlc,  Shapiro's  results  are  of 
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considerable  interest  and  importance. 

In  effect,  what  Shapiro  shows  is  that  in  many  cases  of 
interest,  the  object  can  be  broken  up  into  distinct 
isoplanatlc  elements  such  that  the  contribution  of  each 
element  to  the  image  can  be  separated  from  the  others.  In 
principle,  this  enables  us  to  deal  with  each  isplanatlc 
element  separately  and  then  put  the  individual  images  back 
together  to  form  the  total  image.  In  practice,  this 
procedure  may  be  formidable  in  its  complexity,  but  it  is 
nevertheless  of  interest  to  know  that  conditions  frequently 
exist  under  which  it  is  possible  in  principle. 

In  the  process  of  obtaining  his  results  on 
nonisoplanatic  imaging,  Shapiro  [7]  develops  a model  for  the 
atmosphere  which  we  Intend  to  build  upon  in  our  work.  He 
points  out  that  the  problem  of  optical  imaging  through  a 
turbulent  atmosphere  shares  many  features  with  the  problem 
of  transmitting  and  receiving  information  at  radio 
frequencies  through  random  media  such  as  tropospheric 
scatter  channels.  In  fact,  he  shows  that  the  turbulent 
atmosphere  can  be  modeled  as  a wide-sense  stationary, 
uncorrelated  scatter  (WSSUS)  channel,  which  in  many  respects 
is  the  simplest  and  most  useful  scatter-channel  model  [10]. 
Shapiro  modifies  and  applies  the  results  of  [10]  to  show 
that  when  the  WSSUS  channel  is  underspread  (i.e.,  the 
delay-Doppler  product  la  less  than  one),  the  effects  of  the 
various  isoplanatlc  elements  can  be  separated  as  described 
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above.  This  condition  frequently  occurs  in  practice. 

Thus,  we  have  an  example  in  which  a model  from 
random-channel  communication  theory  has  proven  very  useful 
in  the  analysis  of  optical  imaging  in  atmospheric 
turbulence.  In  addition,  Shapiro  has  shown  that  this  model 
can  be  applied  to  optical  communication  systems  and  to 
speckle  interferometry.  We  believe  that  this  model,  with 
appropriate  modifications,  will  be  very  useful  in  the  work 
which  we  are  proposing.  We  shall  say  more  about  this  in  the 
next  section,  where  we  make  specific  proposals  to  utilize 
this  model . 

To  summarize,  we  have  described  in  this  section  several 
methods  which  have  the  potential  to  achieve  high-resolution 
imaging  in  the  presence  of  atmospheric  turbulence. 
Considerable  work  remains  before  this  potential  can  be 
realized  in  a practical  system,  however.  In  particular,  the 
issues  of  noise  sensitivity,  computational  complexity,  and 
non isoplana t ic  imaging  must  be  investigated  more  carefully. 
Since  sensor  noise  will  be  present  in  any  practical  system, 
it  is  essential  that  we  undertstand  how  this  noise  affects 
various  restoration  procedures,  and  that  we  know  how  to 
minimize  these  effects.  The  issue  of  computational 
complexity  may  determine  whether  a technique  is  hopelessly 
impractical  or  can  be  made  operational.  Finally,  a system 
which  is  to  be  of  use  in  many  practical  situations  must  be 
able  to  deal  with  large  objects  for  which  the  imaging  will 
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very  likely  be  nonisoplanatic . We  expect  the  communication 
theory  model  described  in  this  section  to  be  useful  in  these 
investigations  as  well  as  in  suggesting  alternative  or 
modified  procedures  with  improved  performance  and 
e f f ic iency  . 


FIGURE  5 


Speckle  image  of  Arcturus  made  on  the  Mayall  four-meter 
telescope  at  Kitt  Peak.  Delays  imposed  on  the  wavefront 
by  its  passage  through  the  atmosphere  cause  speckles  whose 
size  is  characteristic  of  the  diffraction  limit  of  the 
tel escope . 


Page  22 


REFERENCES 


[1]  Keith  T.  Knox,  "Diffraction-Limited  Imaging  with 
Astronomical  Telescopes",  Ph.D.  dissertation. 
Institute  of  Optics,  University  of  Rochester,  New 
York,  1975. 


[2]  James  W.  Sherman,  "A  Posteriori  Restoration  of 
Atmospherically  Degraded  Images  Using  Multiframe 
Imagery",  Topical  Meeting  on  Image  Processing, 
Optical  Society  of  America,  Pacific  Grove, 
California,  February  24-26,  1976, 


[3]  C.R.  Lynds,  S.P.  Worden,  and  J.W,  Harvey,  "Digital 
Image  Reconstruction  Applied  to  Alpha  Orionis", 
Astrophysical  Journal,  July  1,  1976. 


[4]  R.A.  Muller  and  A.  Buffington,  "Real-Time 
Correction  of  Atmospherically  Degraded  Telescope 
Images  through  Image  Sharpening",  Journal  of"  the 
Optical  Society  of  America,  September  1974,  pp. 
1200-1210 . 


[5]  A.M.  Schneiderman,  P.F.  Keller,  and  M.G.  Miller, 
"Laboratory  Simulated  Speckle  Interferometry", 
Journal  of  the  Optical  Society  of  America,  November 
1975,  pp.  1287-1291. 


[6]  J.H.  Shapiro,  "Propagation-Medium  Limitations  on 
Phase-Compensated  Atmospheric  Imaging",  Journal  of 
the  Optical  Society  of  America,  May  1976. 


[7]  J.H,  Shapiro,  "Diffraction-Limited  Atmospheric 
Imaging  of  Extended  Objects",  Journal  of  the  Optical 
Society  of  America,  May  1976,  pp.  469-477. 


[8]  J.H,  Shapiro,  "Optimum  Adaptive  Imaging  through 
Atmospheric  Turbulence",  Applied  Optics,  November 
1974,  pp.  2609-2613. 


Page  23 


[9]  J.H.  Shapiro,  "Normal-Mode  Approach  to  Wave 
Propagation  in  the  Turbulent  Atmosphere",  Applied 
Optics,  November  197*4,  pp.  26  1 *4-26  1 9 . 


[ 10] 


R.S.  Kennedy,  "Fading  Dispersive  Communication 
Channels",  John  Wiley  and  Sons,  Inc.,  1969. 


[11]  R.E.  Hufnagel  and  N.R.  Stanley,  "Modulation 
Transfer  Function  Associated  with  Image  Transmission 
through  Turbulent  Media",  Journal  of  the  Optical 
Society  of  America,  January  1964,  pp.  52-61. 


[12]  A.  Labeyrie,  "Attainment  of  Diffraction  Limited 
Resolution  in  Large  Telescopes  by  Fourier  Analyzing 
Speckle  Patterns  in  Star  Images",  Astronomy  and 
Astrophysics,  Vol.  6,  1970,  pp.  85-87. 


[13]  D.Y.  Gezari,  A.  Labeyrie,  and  R.V.  Stachnik, 
"Speckle  Interferometry:  Diffraction-Limited 

Measurements  of  Nine  Stars  with  the  200-Inch 

Telescope",  Astrophysical  Journal,  April  1,  1972,  pp. 
L1-L5. 


[14]  M.G.  Miller  and  D.  Korff,  "Resolution  of  Partially 
Coherent  Objects  by  Use  of  Speckle  Interferometry", 
Journal  of  the  Optical  Society  of  America,  February 
1974,  pp.  155-161. 


[15]  D.  Korff,  "Analysis  of  a Method  for  Obtaining 
Near-Diffraction-Limited  Information  in  the  Presence 
of  Atmospheric  Turbulence",  Journal  of  the  Optical 
Society  of  America,  August  1973,  PP.  971-980. 


[16]  "A  Recent  Result  in  the  Restoration  of 
Turbulence-Degraded  Imagery",  ESL,  Inc.,  Sunnyvale, 
California,  May  12,  1975. 


Page  24 


['7]  C.K.  Rushforth  and  R.W.  Harris,  "Restoration, 
Resolution,  and  Noise",  Journal  of  the  Optical 
Society  of  America,  April  1968,  pp.  539-545. 


[18]  J.W.  Goodman  and  J.F.  Belsher,  "Fundamental 
Limitations  in  Linear  Invariant  Restoration  of 
Atmospherically  Degraded  Images",  Technical  Note, 
Department  of  Electrical  Engineering,  Stanford 
University,  Stanford,  California,  1975. 


[19]  J.C.  Dainty  and  R.  Shaw,  "Image  Science",  Academic 
Press , 1 974  . 


[20]  R.S.  Kennedy,  "On  the  Atmosphere  as  an  Optical 
Communication  Channel",  IEEE  Transactions  on 
Information  Theory,  September  1968,  pp.  716-725. 


Page  25 


StCTlON  2 


POLE-ZtriO  hODHLlNG  USING  A UT OCOh R EL  AT  ION  PREDICTION 
M.AliAtashroo  Steven  E. boll 


Abstract 

This  paper  describes  a method  for  estimating  the  poles 
and  zeros  of  a rational  filter  whose  spectrum  matches  in  the 
linear  prediction  sense,  a given  prototype  spectrum.  The 
lilter  parameters  are  calculated  from  systems  of  linear 
equations  that  result  from  linear  prediction  on 
autocorrelation  functions.  It  is  shown  that  this  method  is 
equivalent  to  spectral  matching  by  inverse  filtering.  The 
matching  process  is  carried  out  in  three  steps.  In  step 
one,  the  standard  linear  prediction  analysis  procedure  is 
modified  to  account  for  zeros  present  in  the  matching 
filter.  The  denominator  coefficients  (pole-predictors)  are 
then  calculated  from  the  resulting  system  of  autocorrelation 
equations.  In  step  two,  the  spectral  contribution  to  these 
poles  is  removed  from  the  given  prototype  spectrum  by 
convoiution  yielding  a residual  spectrum.  In  step  three, 
the  numerator  coefficients,  ( zer o-pr ed ic t or s ) are  calculated 
by  first  matching  the  residual  spectrum  with  a high  order 
all  pole  spectrum,  then  matching  the  inverse  of  the  latter 


with  the  desired  all-zero  filter.  This 


work 


has 


been 
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expanded  and  published  in  technical  report  UTEC-CSc-76-27  1 , 
Atashroo,  M.  "Pole-Zero  Modeling  and  Its  Applications  to 
Speech  Processing",  U.  of  U.  Computer  Science  Dept. 
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Spectral  Matching  By  Inverse  Filtering 

The  spectrum  of  a rational  filter,  H(z)  can  be  matched 

2 2 

to  a given  spectrum  |S(a))|  by  requiring  that  when  |S(u))| 

is  filtered  by  the  inverse  filter,  H(z)  ^ , the  output 

spectrum  is  flat.  That  is,  the  filter  coefficients  of  H(z) 

are  computed  in  order  to  satisfy  the  requirement  that 

|S(a))  1^-  ^ ^ =1  (1  ) 

iH(aj)  r 

This  approximation  technique  is  called  spectral 
matching  by  inverse  filtering,  and  defines  the  frequency 
domain  interpretation  of  pole-zero  modeling  by 
autocorrelation  prediction. 

All-Pole  Spectral  Matching 

Applying  this  interpretation  to  the  approximation  of  a 

2 

given  power  spectrum  |S(ii))|  by  an  all-pole  spectrum, 
G^/  I A(u))  I ^ gives 

lS(a))l^lA(a.)l^  = gJ 


(2) 
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The  filter  coefficients  and  gain  are  computed 

in  order  to  satisfy  equation  2.  Expanding  gives 


S(-)  '*(-)  ■ 


(3) 


But 


|S(< 


R^(n) 


A(u)) 

.2 


a(n) 

G^h(-n) 

In  the  time  domain,  equation  (3)  implies 


A*(a)) 


autocorrelations 

predictors 


: scaled,  a 1 1 -pol e 
response 


R(n)*a(n)  = G^h(-n) , h{0)  = G^  (4) 


Since  the  impulse  response  is  causal,  values  for  the 
predictors,  {a(n)}  and  gain  G^  are  computed  from  the 
well-known  autocorrelation  analysis  equations: 


M 

I a(k)R(n-k)  =0  n > 0 

k = 0 

(5) 

M . 

I a(k)R(k)  = n = 0 

k»0 
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Pole-Zero  Spectral  Matching 

Consider  now  matching  a given  power  spectrum  |S((d)|^  by 
the  spectrum  of  a rational  filter  H(z),  having  both  poles 
and  zeros: 


H(z) 


I b(  i )z 

i = 0 

M 


I a(i  )z 

i=0 


- i 


- i 


(6) 


Applying  the  inverse  filtering  interpretation  requires  that 
the  filter  coefficients  {a(i)}  and  {b(i)}  be  computed  such 
that 


|S(u))|^ 


|A(a))|^ 


(7) 


Expanding  equation  (7)  and  examining  the  time  domain 
relation  gives 


|S(a))  |^A(a,)  = B(a))  = B(a))H*(u))  (8) 


Thus 


R(n)*a(n)  = b(n)*h(-n) 


(9) 
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or 

M L 

R(n)  « -I  a(k)R(n-k)  + [ b(k)h(k-n)  all  n (10) 
k-1  k=0 

For  n > L the  presence  of  the  zeros  in  equation  (10)  is 
removed  by  the  causality  of  h(n),  thus 

M 

R(n)  - -I  a(k)R(n-k)  n > L (11) 

k-1 

The  relation  given  in  equation  (11)  is  referred  to  as 
the  Generalized  Linear  Prediction  normal  equations,  which 
are  solved  to  obtain  the  all-pole  predictors  f a ( k ) } • 
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Pole  hemoval  by  Convolution 

before  estimating  the  zeros  of  H(z),  the  effect  of  the 

approximated  poles  as  defined  by  the  filter  A(z)  is  removed 

2 

from  lS(a))|  by  convolution. 

Define 

|D(u))|^  - |S(a.)|^|A(a,)|^ 

then 

Rp(n)  - Rg(n)*R^(n) 

where 

RgCn)  --  |SCa.)|^ 

R;^(n)  |A(a.)|^ 

M 

RxCn)  - I a(k)a(k*n) 

* k-0 

2 

In  a somewhat  simplified  sense  |S(a))|  can  be  described 

as  being  composed  of  its  spectral  poles  and  zeros.  With  the 

2 2 

removal  of  the  all-pole  match  to  |S((ij)|  by  l/|A(u))l  , the 

2 

spectrum  |D(a))|  can  be  considered  as  the  residual  spectrum 
containing  spectral  zero  information.  These  spectral  zeros 
are  to  be  matched  by  the  all-zero  filter,  b(z). 


(12) 


(13) 
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All-Zero  Approximation  to 


The  all-zero  approximation  is  done  in  two  steps,  first 

2 

lD(ci))|  is  approximated  by  a high  order  all-pole  filter 

Gg/Q(z).  Second,  after  matching  |D((i))|  by  GQ/|Q(a))  , the 

all-zero  filter  B(z)  is  computed  by  matching  jB(a))|^  to  the 

2 2 2 

approximated  inverse  of  |D(aj)|  given  by  |Q(aj)|  /Gg. 

To  obtain  the  coefficients  of  Q(z),  the  inverse 
filtering  relation  is  applied.  Thus 


Using  the  same  procedure  as  with  standard  Linear 
Prediction,  the  analysis  equations  reduce  to 

N 

I Mn-k)q(k)  - 0 
k-0  ^ 

Jg  * Cq 

Finally,  the  all-zero  predictors  {b|^}  are  computed 
using  the  relation 


The  analysis  equation  is  given 


L 

7 b(k)yn-k)  - 0 


bCkjR^Ck)  " g| 


where 


N 

RoCk)  " I q(k)q(n+k) 
n*0 
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Discussion 

As  is  shown  in  the  analysis  block  diagram  (Figure  1), 
the  analysis  method  uses  standard  linear  prediction  to 
estimate  the  zeros  and  so-called  generalized  linear 
prediction  to  estimate  the  poles.  Unfortunately,  with  this 
generalized  method,  stability  of  the  filter  cannot  be 
guaranteed.  There  are  at  least  two  options  for  guaranteeing 
stability,  one,  estimate  the  pole-predictors  with  standard 
linear  prediction  analysis  or  two,  initially  match  the 
pole-zero  filter  to  the  inverted  spectrum,  then  use  the 
reciprocal  of  the  resulting  filter. 

Some  of  the  advantages  of  this  method  for  spectral 
matching  are  that  the  analysis  is  carried  out  in  the  time 
domain  requiring  only  the  solution  of  linear  equations. 
More  important,  since  linear  prediction  can  be  used,  not 
only  can  both  poles  and  zero  be  optimally  coded  using 
reflection  coefficients,  but  the  filter  can  be  implemented 
by  simply  cascading  a normalized  lattice  filter  with  an 
inverted  normalized  lattice  filter. 
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SECTION  3 

Short  Time  Spectrum  Acoustic  Processing 

Michael  Wayne  Callahan 


The  frequency  domain  representation  of  a time  signal 
afforded  by  the  Fourier  transform  is  a powerful  tool  in 
acoustic  signal  processing.  The  usefulness  of  this 
representation  is  rooted  in  the  mechanisms  of  sound 
production  and  perception.  Many  sources  of  sound  exhibit 
normal  modes  or  natural  frequencies  of  vibration,  and  can  be 
described  concisely  in  the  frequency  domain.  The  human 
auditory  system  performs  frequency  analysis  early  in  the 
hearing  process,  so  perception  is  often  best  described  by 
frequency  domain  parameters. 

The  published  technical  report  UTEC-CSc-7 6-209  , March 
1976,  "Acoustic  Signal  Processing  Based  on  the  Short  Time 
Spectrum"  by  M.  Callahan,  U.  of  U.  Computer  Science 
Dept.,  investigates  a new  approach  to  acoustic  signal 
processing  based  on  the  short-time  Fourier  transform,  a 
two-dimensional  representation  which  shows  the  time  and 
frequency  structure  of  sounds.  This  representation  is 
appropriate  for  signals  such  as  speech  and  music,  where  the 
natural  frequencies  of  the  source  change  and  timing  of  these 
changes  is  important  to  perception.  The  principal  advantage 
of  this  approach  is  that  the  signal  processing  domain  is 
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similar  to  the  perceptual  domain,  so  that  signal 
modifications  can  be  related  to  perceptual  criteria. 

The  mathematical  basis  for  this  type  of  processing  is 
developed,  and  four  examples  are  described:  removal  of 
broadband  background  noise,  isolation  of  perceptually 
important  speech  features,  dynamic  range  compression  and 
expansion,  and  removal  of  locally  periodic  interfering 
signals. 


SECTION  4 
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Log  Spectral  Estimation  for  Stationary 

and  Nonstationary  Processes 
Robert  B.  Ingebretsen 


This  research  is  concerned  with  two  log  spectral 
estimators  in  the  context  of  both  stationary  and 
nonstationary  signals.  They  differ  because  in  one, 
smoothing  is  realized  before  the  logarithmic  transformation, 
while  the  other  is  smoothed  in  the  logarithmic  domain.  It 
is  shown  that  for  stationary  signals  the  two  estimators  are 
similar,  differing  in  expected  value  by  only  a universal 
constant.  The  first  estimator,  however,  is  smoother.  For 
nonstationary  signals,  the  estimators  are  biased  by 
different  amounts  dependent  upon  the  nonstationarity.  The 
difference  between  the  estimators  is  shown  to  be  a sensitive 
test  for  nonstationarity.  The  estimators  are  used  in  the 
analysis  and  implementation  of  two  solutions  to  the  problem 
of  blind  deconvolution.  It  is  found  that  the  methods  are 
equivalent  for  stationary  signals,  but  differ  markedly  for 
nonstationary  signals  in  the  presence  of  stationary 
background  noise.  Recommendations  are  made  for  the 
practical  digital  implementation  of  the  log  spectral 
estimators.  This  research  has  been  published  in  the 


technical  report  UTEC-CSc-75- 1 1 8 , August  1 976  , "Log  Spectral 


Kstlmation  for  Stationary  and  Nonstationary  Processes"  by  R. 
Ingebretsen,  U.  of  U.  Computer  Science  Dept. 
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SECTION  5 

Word  Recognition  in  Continuous  Speech 
Using  Linear  Prediction  Analysis 

Richard  W.  Christiansen 

A promising  method  of  automatic  word  recognition  in 
continuous  speech,  recently  designated  as  word  spotting,  has 
been  demonstrated.  The  method  uses  error  residual  ratios 
from  LPC  (Linear  Predictive  Coding)  vocoder  analysis  for 
^ waveform  comparison  and  a dynamic  programming  procedure  for 

time  registration  between  the  incominc-  speech  and  a template 
of  the  key  word.  Using  a similarity  threshold,  the  incoming 
speech  is  compared  witii  several  templates  to  account  for 
variability  in  spectral  shape.  This  system  can  work  in  real 
time  using  a real  time  vocoder. 

The  multiple  templates  are  used  in  such  a way  that  a 
small  number  of  templates,  three  or  four,  is  made  to  look 
like  several  hundred  or  more.  This  is  accomplished  by 
dynamically  constructing  a composite  template  from  parts  of 
each  single  template  as  part  of  the  processing  of  the 

incoming  speech.  Thus  a particular  composite  template  is 
constructed  for  each  word  being  recognized. 
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An  accuracy  of  99  percent  with  no  false  alarms  was 
achieved  using  205  key  words,  five  different  speakers,  and 
approximately  ten  minutes  of  speech  text.  Performance  in 
the  presence  of  additive  white  Gaussian  noise  of 
approximately  IldB  slgnal-to-noise  ratio  was  66  percent. 
When  the  speech  was  processed  to  account  for  the  noise, 
results  improved  to  85  percent  to  90  percent  accuracy. 
Finally  a digit  recognition  experiment  was  performed  using 
over  1200  digits  spoken  by  ten  different  people  with  a 
resultant  accuracy  of  97  percent.  This  research  has  been 
published  in  the  technical  report  UTEC-CSc-7  6-226  , August 
1976,  "Word  Recognition  in  Continuous  Speech  using  Linear 
Prediction  Analysis"  by  R.  Christiansen,  U.  of  U. 
Computer  Science  Dept. 
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SECTION  6 

Dynamic  Filtering  of  Degraded  Speech 
Using  Autocorrelation  Prediction 

M.AllAtashroo  TracyPetersen 


Abstract 

A dynamic  smoothed  wiener  filter  is  estimated  from  the 
degraded  speech.  Then,  this  dynamic  smoothed  wiener  filter 
is  approximated  to  desired  accuracy  with  a time-varying 
rational  filter  using  autocorrelation  prediction  [1].  This 
time-varying  rational  filter  is  specified  with  two  sets  of 
parcoi  parameters,  and  is  realized  as  a cascade  of  two 
lattice  filters.  Finally,  the  degraded  speech  is  filtered 
by  this  realization.  This  work  has  been  expanded  and 
published  in  technical  report  UTEC-CSc-7 6-27 1 , 1976,  "Pole 

Zero  Modeling  and  its  Applications  to  Speech  Processing",  U. 
of  U.  Computer  Science  Dept. 
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Introduction 

The  speech  is  degraded  with  additive  stationary  colored 

noise.  Assuming  the  speech  itself  is  stationary  during 

short  intervals  (T  = 30  msec),  a smoothed  wiener  filter 

W(kT,ii')  is  estimated  for  each  of  these  intervals.  The 

spectrum  of  a rational  filter  H(kT,ai)  is  then  matched  to  the 

2 

l/lW(kT,u))|  using  Autocorrelation  Prediction  [1].  Finally, 
the  rational  filter  l/H(kT,w)  is  used  to  filter  the 
corresponding  interval  of  the  degraded  speech. 

Estimation  of  the  Smoothed  Wiener  Filter 


The  smoothed  wiener  filter  for  each  interval  T is 
estimated  from  the  following  equation. 

|W(kT,u.)|=  1 - (1) 

where,  4>jyi(w)  is  an  estimate  for  the  smoothed  spectrum  of 
the  stationary  noise  and  $g^j^(kT,a))  is  an  estimate  for  the 
smoothed  spectrum  of  the  degraded  speech  during  the  kth 
interval.  is  obtained  by  averaging  the  short-time 

Linear  Prediction  (LP)  spectrum  of  the  noise  - the  silent 
portions  of  the  degraded  speech  during  the  whole  passage. 

kT, (A) ) is  obtained  by  averaging  the  short-time  LP 
spectrum  of  the  degraded  speech  during  the  kth  interval. 
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Spectral  Matching  of  Smoothed  Wiener  Filter 

Equation  1 shows  that  the  smoothed  wiener  filter  has 

flat  pass  bands  for  those  frequencies  where  the  speech 

dominates  the  noise,  and  has  deep  stop  bands  at  those 

frequencies  where  the  noise  dominates  the  speech.  In  the 

case  where  speech  dominates  the  noise  at  moat  of  the 

frequencies,  this  suggests  to  match  the  valleys  of  the 

smoothed  wiener  spectrum  first.  This  can  be  accomplished  by 

matching  the  spectrum  of  a rational  filter  H(kT,u)  to  the 

2 

l/|W(kT,a))|  using  Autocorrelation  Prediction.  Then,  the 
rational  filter  l/H(kT,a))  is  used  to  filter  the  kth  interval 
of  the  degraded  speech.  In  this  way,  the  computation  and 
the  number  of  parameters  of  the  matching  rational  filter 
H(kT,u))  is  reduced  considerably  from  an  all-pole  match  to 
|W(kT,u))|^. 

While  the  stability  may  not  be  guaranteed  for  the 
rational  filter  H(kT,(D),  it  is  guaranteed  for  l/H(kT,(jj)  [1]. 

Implementation  and  Results 

The  rational  filter  1/H(kT,a))  is  identified  by  two  sets 
of  the  parcor  parameters  obtained  from  the  Autocorrelation 
Prediction.  The  l/H(kT,u))  is  realized  as  the  cascade  of  two 
lattice  filters  - inverse  and  all-pole.  To  improve  the 
filtering  process  both  sets  of  the  parcor  parameters  are 
interpolated  for  each  sample  of  the  input  degraded  speech. 


Page  '*7 


REFERENCES 


[1]  M.  All  Atashroo,  S^F.  Boll,  "Pole-Zero  Modeling 
Using  Autocorrelation  Prediction",  presented  at  1976 
Arden  House  Workshop. 


Page  48 


SECTION  7 


Linear  Predictive  Coding  With  a Glottal  Waveform  Model 


William  Done 


Speech  analysis  using  the  linear  prediction  technique, 
based  on  the  approximation  of  the  nth  speech  sample  s(n)  as 
a summation  of  the  N previous,  linearly-weighted  samples, 
achieves  good  spectral  matching  to  the  poles  in  the  speech 
spectrum.  Zeros  of  the  spectrum  are  not  matched  as  well. 


To  better  model  the  zeros,  it  is  assumed  that  the  error 
signal 


e(n)  = a(n)  - s(n) 

N 

= s(n)  - ^ a(i)s(n-i) 

i=l 

could  be  modelled  as  the  effect  of  the  glottal  pulse  on  the 
zeros  of  the  vocal  tract.  That  is 

N 

s(n)  = I a(i)s(n-i)  + e(n) 
i=l 
N 

= y a(i)s(n-i)  + I b(j)g(n-j), 
i=l  j=0 

where  the  b(J)  are  the  zero  coefficients  and  g(n)  is  an 

assumed  glottal  waveform  model.  The  analysis  procedure  is 

summarized  as  follows:  determine  the  {a(i)}  sequence  using 

the  covariance  technique,  calculate  the  error  signal, 

approximate  the  error  signal  by  calculating  the  {b(j)> 

M 

sequence  so  that  ^ b(J)g(n-J)  approximates  e(n)  in  the 

j-0 
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least  squares  sense.  Synthesis  uses  the  coefficient 
sequences  {a(i)]  and  {b{j)}  and  the  assumed  glottal  waveform 
to  calculate 

N . M 

s(n)  = I a(i)s(n-i)  + I b(j)g(n-j) 
i=l  j=0 

Models  for  g(n)  were  based  on  results  by  Rosenberg  [1] 
and  Holmes  [2]  - a polynomial-based  model  and  a triangular 
model.  Square  and  triangular  doublets  were  also  used. 

Initially,  the  vocoder  structure  for  this  system  was  a 
fixed-frame  size  for  analysis  and  synthesis.  This  led  to 
amplitude  modulation  of  the  speech  envelope  and  poor  quality 
synthetic  speech.  Changing  to  a pitch  synchronous  system 
for  analysis  and  synthesis  improved  that  aspect  of  the 
synthesis,  but  it  was  evident  that  the  synthetic  speech  was 
still  degraded  by  spectrum-matching  errors.  Experimental 
data  showed  that  even  with  the  best  glottal  waveform  model 
found,  the  triangular  doublet,  the  synthesis  quality  was 
fairly  insensitive  to  the  number  of  zeros  for  a moderate 
number  of  zeros  (i20). 

Because  the  goals  of  research  Included  improvement  of 
speech  quality  while  maintaining  reasonable  bit  rates,  this 
project  was  terminated. 
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SECTION  8 


Speech  Synthesis  Using  the  Complex  Cepstrum 


The  phase  of  vowels  is  computed,  unwrapped  and 
incorporated  into  speech  analysis  - synthesis  technique. 
This  task  is  to  create  synthetic  speeches  containing  the 
phase  information  of  the  original  human  speech.  The  careful 
comparison  between  "phased"  and  "phaseless"  synthetic 
speeches  could  reveal  certain  insight  to  the  effect  of  phase 
on  the  vocoder. 

The  log  magnitude  is  obtained  by  taking  the  DFT  of  the 
analysis  data  buffer  as  in  an  ordinary  real  cepstrum 
homomorphic  vocoder.  However,  to  avoid  the  difficulty  of 
unwrapping  the  phase  of  a periodic  waveform,  the  unwrapped 
phase  is  obtained  by  unwrapping  the  phase  of  an  isolated 
wavelet  which  serves  as  a prototype  wavelet  for  the  adjacent 
waveform  within  this  data  buffer.  This  unwrapped  phase  can 
be  visualized  as  if  it  were  obtained  through  the  process  of 
taking  the  DFT  of  this  data  buffer,  which  includes  several 
pitch  periods,  unwrapping  and  then  smoothing. 

As  far  as  analysis  of  speech  is  concerned,  the  log 

magnitude  and  the  "smoothed"  phase  is  then  taken  as  the  real 
and  the  imaginary  part  to  calculate  the  complex  cepstrum. 
The  impulse  response  results  easily  after  cepstrum 


filtering. 
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Figure  1 gives  an  example  of  an  original  vowel 
waveform,  and  the  corresponding  synthetic  impulse  response 
and  synthetic  speech  obtained  by  this  method. 


SECTION  9 
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Image  Understanding 

Martin  Newell 

The  work  in  image  understanding  is  progressing  on  all 
four  tasks  outlined  in  the  previous  semi-annual  report. 
However,  no  single  area  has  progressed  to  the  stage  where 
significant  results  are  available.  However,  no  fundamental 
difficuties  have  been  uncovered  so  that  at  the  end  of  the 
next  six  months  we  expect  to  have  all  the  necessary  pieces 
of  the  system  working  together  on  the  PDP-11/H5  facility. 
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